Collecting Resources in Sub-Saharan African Languages for Automatic Speech Recognition: a Case Study of Wolof
نویسندگان
چکیده
This article presents the data collected and ASR systems developped for 4 sub-saharan african languages (Swahili, Hausa, Amharic and Wolof). To illustrate our methodology, the focus is made on Wolof (a very under-resourced language) for which we designed the first ASR system ever built in this language. All data and scripts are available online on our github repository.
منابع مشابه
Speed Perturbation and Vowel Duration Modeling for ASR in Hausa and Wolof Languages
Automatic Speech Recognition (ASR) for (under-resourced) Sub-Saharan African languages faces several challenges: small amount of transcribed speech, written language normalization issues, few text resources available for language modeling, as well as specific features (tones, morphology, etc.) that need to be taken into account seriously to optimize ASR performance. This paper tries to address ...
متن کاملAutomatic Speech Recognition Using Probabilistic Transcriptions in Swahili, Amharic, and Dinka
In this study, we develop automatic speech recognition systems for three sub-Saharan African languages using probabilistic transcriptions collected from crowd workers who neither speak nor have any familiarity with the African languages. The three African languages in consideration are Swahili, Amharic, and Dinka. There is a language mismatch in this scenario. More specifically, utterances spok...
متن کاملIs the Role of Physicians Really Evolving Due to Non-physician Clinicians Predominance in Staff Makeup in Sub-Saharan African Health Systems?; Comment on “Non-physician Clinicians in Sub-Saharan Africa and the Evolving Role of Physicians”
Health workforce shortages in Sub-Saharan Africa are widely recognized, particularly of physicians, leading the training and deployment of Non-physician clinicians (NPCs). The paper by Eyal et al provides interesting and legitimate viewpoints on evolving role of physicians in context of decisive increase of NPCss in Sub-Saharan Africa. Certainly, in short or mid-term, NPCs will continue to be a...
متن کاملMachine Assisted Analysis of Vowel Length Contrasts in Wolof
Growing digital archives and improving algorithms for automatic analysis of text and speech create new research opportunities for fundamental research in phonetics. Such empirical approaches allow statistical evaluation of a much larger set of hypothesis about phonetic variation and its conditioning factors (among them geographical / dialectal variants). This paper illustrates this vision and p...
متن کاملA Database for Automatic Persian Speech Emotion Recognition: Collection, Processing and Evaluation
Abstract Recent developments in robotics automation have motivated researchers to improve the efficiency of interactive systems by making a natural man-machine interaction. Since speech is the most popular method of communication, recognizing human emotions from speech signal becomes a challenging research topic known as Speech Emotion Recognition (SER). In this study, we propose a Persian em...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016